Lamar County
Merging Embedded Topics with Optimal Transport for Online Topic Modeling on Data Streams
Granese, Federica, Navet, Benjamin, Villata, Serena, Bouveyron, Charles
Topic modeling is a key component in unsupervised learning, employed to identify topics within a corpus of textual data. The rapid growth of social media generates an ever-growing volume of textual data daily, making online topic modeling methods essential for managing these data streams that continuously arrive over time. This paper introduces a novel approach to online topic modeling named StreamETM. This approach builds on the Embedded Topic Model (ETM) to handle data streams by merging models learned on consecutive partial document batches using unbalanced optimal transport. Additionally, an online change point detection algorithm is employed to identify shifts in topics over time, enabling the identification of significant changes in the dynamics of text streams. Numerical experiments on simulated and real-world data show StreamETM outperforming competitors.
- Asia > Middle East > Israel (0.05)
- Asia > Middle East > Republic of Türkiye (0.05)
- Europe > France > Provence-Alpes-Côte d'Azur (0.04)
- (5 more...)
- Leisure & Entertainment > Sports > Baseball (0.46)
- Information Technology > Security & Privacy (0.46)
Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Nagashima, Shunya, Sugiura, Komei
Accurate, reliable solar flare prediction is crucial for mitigating potential disruptions to critical infrastructure, while predicting solar flares remains a significant challenge. Existing methods based on heuristic physical features often lack representation learning from solar images. On the other hand, end-to-end learning approaches struggle to model long-range temporal dependencies in solar images. In this study, we propose Deep Space Weather Model (Deep SWM), which is based on multiple deep state space models for handling both ten-channel solar images and long-range spatio-temporal dependencies. Deep SWM also features a sparse masked autoencoder, a novel pretraining strategy that employs a two-phase masking approach to preserve crucial regions such as sunspots while compressing spatial information. Furthermore, we built FlareBench, a new public benchmark for solar flare prediction covering a full 11-year solar activity cycle, to validate our method. Our method outperformed baseline methods and even human expert performance on standard metrics in terms of performance and reliability. The project page can be found at https://keio-smilab25.github.io/DeepSWM.
- Asia > Japan (0.40)
- North America > Canada > Ontario > Toronto (0.04)
- North America > United States > Colorado (0.04)
- (3 more...)
- Information Technology > Artificial Intelligence > Vision (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
The TESS Ten Thousand Catalog: 10,001 uniformly-vetted and -validated Eclipsing Binary Stars detected in Full-Frame Image data by machine learning and analyzed by citizen scientists
Kostov, Veselin B., Powell, Brian P., Fornear, Aline U., Di Fraia, Marco Z., Gagliano, Robert, Jacobs, Thomas L., de Lambilly, Julien S., Luca, Hugo A. Durantini, Majewski, Steven R., Omohundro, Mark, Orosz, Jerome, Rappaport, Saul A., Salik, Ryan, Short, Donald, Welsh, William, Alexandrov, Svetoslav, da Silva, Cledison Marcos, Dunning, Erika, Guhne, Gerd, Huten, Marc, Hyogo, Michiharu, Iannone, Davide, Lee, Sam, Magliano, Christian, Sharma, Manya, Tarr, Allan, Yablonsky, John, Acharya, Sovan, Adams, Fred, Barclay, Thomas, Montet, Benjamin T., Mullally, Susan, Olmschenk, Greg, Prsa, Andrej, Quintana, Elisa, Wilson, Robert, Balcioglu, Hasret, Kruse, Ethan, Collaboration, the Eclipsing Binary Patrol
The Transiting Exoplanet Survey Satellite (TESS) has surveyed nearly the entire sky in Full-Frame Image mode with a time resolution of 200 seconds to 30 minutes and a temporal baseline of at least 27 days. In addition to the primary goal of discovering new exoplanets, TESS is exceptionally capable at detecting variable stars, and in particular short-period eclipsing binaries which are relatively common, making up a few percent of all stars, and represent powerful astrophysical laboratories for deep investigations of stellar formation and evolution. We combed Sectors 1-82 of TESS Full-Frame Image data searching for eclipsing binary stars using a neural network that identified ~1.2 million stars with eclipse-like features. Of these, we have performed an in-depth analysis on ~60,000 targets using automated methods and manual inspection by citizen scientists. Here we present a catalog of 10001 uniformly-vetted and -validated eclipsing binary stars that passed all our ephemeris and photocenter tests, as well as complementary visual inspection. Of these, 7936 are new eclipsing binaries while the remaining 2065 are known systems for which we update the published ephemerides. We outline the detection and analysis of the targets, discuss the properties of the sample, and highlight potentially interesting systems. Finally, we also provide a list of ~900,000 unvetted and unvalidated targets for which the neural network found eclipse-like features with a score higher than 0.9, and for which there are no known eclipsing binaries within a sky-projected separation of a TESS pixel (~21 arcsec).
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
- North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
- North America > United States > Gulf of Mexico > Central GOM (0.04)
- (13 more...)
- Government > Space Agency (0.46)
- Government > Regional Government (0.46)
Adaptive Detection of Fast Moving Celestial Objects Using a Mixture of Experts and Physical-Inspired Neural Network
Jia, Peng, Li, Ge, Cheng, Bafeng, Li, Yushan, Sun, Rongyu
Fast moving celestial objects are characterized by velocities across the celestial sphere that significantly differ from the motions of background stars. In observational images, these objects exhibit distinct shapes, contrasting with the typical appearances of stars. Depending on the observational method employed, these celestial entities may be designated as near-Earth objects or asteroids. Historically, fast moving celestial objects have been observed using ground-based telescopes, where the relative stability of stars and Earth facilitated effective image differencing techniques alongside traditional fast moving celestial object detection and classification algorithms. However, the growing prevalence of space-based telescopes, along with their diverse observational modes, produces images with different properties, rendering conventional methods less effective. This paper presents a novel algorithm for detecting fast moving celestial objects within star fields. Our approach enhances state-of-the-art fast moving celestial object detection neural networks by transforming them into physical-inspired neural networks. These neural networks leverage the point spread function of the telescope and the specific observational mode as prior information; they can directly identify moving fast moving celestial objects within star fields without requiring additional training, thereby addressing the limitations of traditional techniques. Additionally, all neural networks are integrated using the mixture of experts technique, forming a comprehensive fast moving celestial object detection algorithm. We have evaluated our algorithm using simulated observational data that mimics various observations carried out by space based telescope scenarios and real observation images. Results demonstrate that our method effectively detects fast moving celestial objects across different observational modes.
- North America > United States > Alabama > Lamar County (0.45)
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- Asia > China > Jiangsu Province > Nanjing (0.04)
Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlearning
Moon, Saemi, Lee, Minjong, Park, Sangdon, Kim, Dongwoo
As text-to-image diffusion models become advanced enough for commercial applications, there is also increasing concern about their potential for malicious and harmful use. Model unlearning has been proposed to mitigate the concerns by removing undesired and potentially harmful information from the pre-trained model. So far, the success of unlearning is mainly measured by whether the unlearned model can generate a target concept while maintaining image quality. However, unlearning is typically tested under limited scenarios, and the side effects of unlearning have barely been studied in the current literature. In this work, we thoroughly analyze unlearning under various scenarios with five key aspects. Our investigation reveals that every method has side effects or limitations, especially in more complex and realistic situations. By releasing our comprehensive evaluation framework with the source codes and artifacts, we hope to inspire further research in this area, leading to more reliable and effective unlearning methods.
- North America > United States > Texas > Stonewall County (0.04)
- North America > United States > Alabama > Lamar County (0.04)
- Transportation (0.70)
- Information Technology (0.48)
Analysis: How Russia, Ukraine's militaries stack up after two years of war
Ukraine has been fighting Russia for two years to liberate its lands and drive Russia back – but supply, tactics and the flat terrain have meant that the much-vaunted Ukrainian counteroffensive of last year has produced few tangible results. In the wide-open agricultural land of southern Ukraine, there is not much in the way of cover for an attacking force. Russia had months to prepare its defences, and built them in depth. Row after row of trenches, anti-tank obstacles, ditches and reinforced bunkers have formed a barrier, often kilometres deep, effectively containing Ukrainian forces as they have repeatedly tried to break through into the open country beyond, with little success. The counteroffensive has bogged down into slow, attritional warfare, as Russia's strategy of making Ukraine pay for every metre it tries to take is showing signs of succeeding.
- Europe > Ukraine (1.00)
- Asia > Russia (1.00)
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.06)
- (4 more...)
- Government > Military (1.00)
- Government > Regional Government > Europe Government (0.35)
Nano: Nested Human-in-the-Loop Reward Learning for Few-shot Language Model Control
Fan, Xiang, Lyu, Yiwei, Liang, Paul Pu, Salakhutdinov, Ruslan, Morency, Louis-Philippe
Pretrained language models have demonstrated extraordinary capabilities in language generation. However, real-world tasks often require controlling the distribution of generated text in order to mitigate bias, promote fairness, and achieve personalization. Existing techniques for controlling the distribution of generated text only work with quantified distributions, which require pre-defined categories, proportions of the distribution, or an existing corpus following the desired distributions. However, many important distributions, such as personal preferences, are unquantified. In this work, we tackle the problem of generating text following arbitrary distributions (quantified and unquantified) by proposing Nano, a few-shot human-in-the-loop training algorithm that continuously learns from human feedback. Nano achieves state-of-the-art results on single topic/attribute as well as quantified distribution control compared to previous works. We also show that Nano is able to learn unquantified distributions, achieves personalization, and captures differences between different individuals' personal preferences with high sample efficiency.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > United States > Iowa (0.04)
- North America > United States > California > San Francisco County > San Francisco (0.04)
- (17 more...)
- Media (1.00)
- Leisure & Entertainment (1.00)
- Health & Medicine (1.00)
- (3 more...)
High Frequency, High Accuracy Pointing onboard Nanosats using Neuromorphic Event Sensing and Piezoelectric Actuation
Latif, Yasir, Anastasiou, Peter, Ng, Yonhon, Prime, Zebb, Lu, Tien-Fu, Tetlow, Matthew, Mahony, Robert, Chin, Tat-Jun
As satellites become smaller, the ability to maintain stable pointing decreases as external forces acting on the satellite come into play. At the same time, reaction wheels used in the attitude determination and control system (ADCS) introduce high frequency jitter which can disrupt pointing stability. For space domain awareness (SDA) tasks that track objects tens of thousands of kilometres away, the pointing accuracy offered by current nanosats, typically in the range of 10 to 100 arcseconds, is not sufficient. In this work, we develop a novel payload that utilises a neuromorphic event sensor (for high frequency and highly accurate relative attitude estimation) paired in a closed loop with a piezoelectric stage (for active attitude corrections) to provide highly stable sensor-specific pointing. Event sensors are especially suited for space applications due to their desirable characteristics of low power consumption, asynchronous operation, and high dynamic range. We use the event sensor to first estimate a reference background star field from which instantaneous relative attitude is estimated at high frequency. The piezoelectric stage works in a closed control loop with the event sensor to perform attitude corrections based on the discrepancy between the current and desired attitude. Results in a controlled setting show that we can achieve a pointing accuracy in the range of 1-5 arcseconds using our novel payload at an operating frequency of up to 50Hz using a prototype built from commercial-off-the-shelf components. Further details can be found at https://ylatif.github.io/ultrafinestabilisation
- North America > United States > Alabama > Lamar County (0.25)
- Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
- Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Variational Inference for Deblending Crowded Starfields
Liu, Runjing, McAuliffe, Jon D., Regier, Jeffrey
In images collected by astronomical surveys, stars and galaxies often overlap visually. Deblending is the task of distinguishing and characterizing individual light sources in survey images. We propose StarNet, a Bayesian method to deblend sources in astronomical images of crowded star fields. StarNet leverages recent advances in variational inference, including amortized variational distributions and an optimization objective targeting an expectation of the forward KL divergence. In our experiments with SDSS images of the M2 globular cluster, StarNet is substantially more accurate than two competing methods: Probabilistic Cataloging (PCAT), a method that uses MCMC for inference, and DAOPHOT, a software pipeline employed by SDSS for deblending. In addition, the amortized approach to inference gives StarNet the scaling characteristics necessary to perform Bayesian inference on modern astronomical surveys.
- North America > United States > Alabama > Lamar County (0.24)
- Asia > Middle East > Jordan (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- (4 more...)
A Novel Application of Conditional Normalizing Flows: Stellar Age Inference with Gyrochronology
Van-Lane, Phil, Speagle, Joshua S., Douglas, Stephanie
Stellar ages are critical building blocks of evolutionary models, but challenging to measure for low mass main sequence stars. An unexplored solution in this regime is the application of probabilistic machine learning methods to gyrochronology, a stellar dating technique that is uniquely well suited for these stars. While accurate analytical gyrochronological models have proven challenging to develop, here we apply conditional normalizing flows to photometric data from open star clusters, and demonstrate that a data-driven approach can constrain gyrochronological ages with a precision comparable to other standard techniques. We evaluate the flow results in the context of a Bayesian framework, and show that our inferred ages recover literature values well. This work demonstrates the potential of a probabilistic data-driven solution to widen the applicability of gyrochronological stellar dating.
- North America > Canada > Ontario > Toronto (0.15)
- North America > United States > Alabama > Lamar County (0.05)
- North America > United States > Pennsylvania > Northampton County > Easton (0.04)
- (3 more...)
- Research Report (0.64)
- Overview > Innovation (0.40)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.48)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)